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Blunder  detection  is  a  key  step  in  achieving  optimal 
results  from  a  least  squares  adjustment.  One  approach  to 
blunder  detection  is  to  employ  a  detection  algorithm  in  a 
sequential  least  squares  adjustment.  A  sequential  adjustment 
technique  compares  the  variability  of  the  observations  added 
sequentially  to  that  of  the  observations  already  incorpo¬ 
rated  into  the  solution.  The  rejection  criterion  for  obser¬ 
vations  is  based  upon  an  F-statistic.  The  F-statistic  is 
formed  by  the  ratio  of  key,  independent  quadratic  forms  of 
residuals  corresponding  to  the  observations  added  sequen¬ 
tially  and  those  already  accepted  in  the  solution.  Neither 
an  a  priori  nor  an  a  posteriori  reference  variance  is 
required  for  the  blunder  detection  algorithm.  The  sequential 
algorithm  is  computationally  efficient  because  it  combines 
the  least  squares  adjustment  and  the  blunder  detection  into 
one  step.  The  sequential  least  squares  blunder  detection 
algorithm  is  programmable  and  is  tested  on  measured  data 
sampled  from  a  simple  model,  a  straight  line. 
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Problem  Definition 


1.  1 

Blunder  detection  is  a  key  step  in  achieving  optimal 
results  from  a  least  squares  adjustment.  For  the  purpose  of 
this  paper,  a  blunder  is  any  gross  error  in  the  observation 
data  or  adjustment  procedure  that  causes  the  adjustment  to  be 
invalid  for  its  intended  purpose.  Using  this  definition, 
blunder  detection  can  be  considered  as  a  measure  of  quality 
control  or  reliability  of  the  adjustment  results.  Modem 
developments  in  blunder  detection  began  with  Baarda  ,  who 
developed  a  method  called  data  snooping  for  detecting  gross 
errors  in  geodetic  nets.  ( Baarda,  1967)  Baarda  treats  gross 
error  detection  as  a  statistical  reliability  problem  and  he 
offers  several  objective  measures  for  quality  of  results 
obtained  from  least  squares  adjustments.  Many  have  refined 
his  techniques  and  have  adapted  them  to  aerotriangulation. 
Pope  also  developed  a  theoretical  basis  for  detecting  bad 
data  using  the  "Tau"  criterion  on  the  results  of  simultaneous 
least  squares  adjustments  of  triangulation  networks.  (Pope, 
1976)  These  methods  provide  a  more  sound  basis  for  data 
rejection  than  was  previously  available  and  have  provided 
many  new  topics  for  study.  One  such  topic  is  the  application 


2 

of  statistically  rigorous  gross  error  detection  methods  to 
sequential  least  squares  adjustments. 

There  are  many  advantages  in  developing  a  blunder 
detection  method  incorporated  in  the  sequential  least  squares 
adjustment  technique. 

1.  Sequential  techniques  permit  deletion  of  bad  data 
with  greater  computational  efficiency  than  batch 
adjustments. 

2.  Many  problems  are  better  suited  for  a  sequential 
adjustment. 

3.  A  successful  sequential  blunder  check  provides  the 
blunder's  location. 

4.  Use  of  a  sequential  technique  implies  that  after 
each  iteration  there  is  at  most  one  blunder  ,  the 
most  recently  added  observation. 

5.  One  may  examine  the  effect  of  adding  and  deleting 
the  observations  on  parameter  estimates  and  their 
covariance  matrix. 

6.  Reobservation  may  be  impossible  after  simultaneous 
adjustment.  The  sequential  approach  may  allow  re¬ 
observation  and  thus  eliminate  singularities  in 
the  normal  equation  coefficient  matrix  caused  by 
deleting  blunders  after  a  simultaneous  adjustment. 

7.  A  remeasurement  capability  may  allow  achievement 
of  optimum  levels  for  type  I  and  II  errors. 
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There  are  also  several  challenges  to  approaching 
blunder  detection  with  a  sequential  algorithm.  First,  sequen¬ 
tial  adjustments  require  the  inverse  of  the  normal  equation 
coefficient  matrix.  Inversion  is  not  an  additional  burden 
since  most  statistically  based  blunder  detection  methods  also 
require  the  inverse.  The  sequential  technique  is  actually 
computationally  more  efficient  because  it  combines  the  blun¬ 
der  detection  step  with  the  actual  adjustment  -  One  must 
also  be  aware  that  many  techniques  already  c  eloped  rely  on 
the  central  limit  theorem  to  employ  a  number  parametric 
tests.  Sequential  analysis  would  require  analysis  of  the 
effects  of  sample  size  and  redundancy  on  the  test.  Then, 
there  is  the  fact  that  the  sequential  technique  does  not 
yield  the  true  residual  at  a  given  step  in  the  adjustment, 
but  a  residual  based  on  parameters  estimated  from  observa¬ 
tions  added  to  that  point.  A  new  estimate  of  the  reference 
variance  will  be  required  for  each  step  of  the  sequential 
adjustment.  Finally,  the  assumption  that  only  one  blunder  is 
present  may  be  violated  as  the  test  sensitivity  increases 
when  the  observations  are  added.  In  other  words,  as  the  solu¬ 
tion  begins  to  converge  when  more  observations  are  added,  is 
it  possible  that  previously  undetectable  blunders  could,  in 
fact,  be  detected?  The  answer  to  these  questions  will  be  a 
major  portion  of  this  thesis. 
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1.  2  Thesis  Objective 


The  objective  of  this  thesis  is  to  develop  and  test  an 
algorithm  for  detecting  blunders  in  observation  data  using  a 
sequential  least  squares  adjustment  technique.  The  objective 
will  also  entail  developing  a  computer  program  as  the  means 
of  testing  the  algorithm.  The  algorithm  should  meet  the  fol¬ 
lowing  requirements: 

1.  It  should  detect  blunders  of  small  magnitude  and 
blunders  of  large  magnitude  equally  well. 

2.  It  should  provide  blunder  location  ,  if  possible, 
without  reobservation. 

3.  The  algorithm  should  be  computationally  efficient, 
and  should  not  be  restricted  to  standard  cases. 

4.  It  must  determine  tolerances  and  the  maximum  unde¬ 
tectable  blunder. 

5.  The  algorithm  should  not  assume  a  priori  knowledge 
of  the  blunder's  magnitude,  location,  or  frequency 
of  occurence. 

6.  It  should  incorporate  statistical  measures  of 
confidence  for  types  I  and  II  errors. 


1. 3  Definition  of  a  Blunder 

Before  proceding  with  the  algorithm  development,  it 
would  be  useful  to  define  blunder.  A  blunder  is  an  outlier, 
or  observation  which  appears  to  be  inconsistent  with  the 
remainder  of  the  data  set. (Barnett  and  Lewis,  1978).  Outli¬ 
ers,  though,  are  not  necessarily  bad  or  erroneous  and  should 
not  universally  be  rejected  when  found.  Their  presence  may 
help  explain,  or  at  least  help  identify,  a  true  anomaly  that 
should  actually  be  incorporated  into  the  solution.  This 
requires  a  thorough  knowledge  of  the  model  or  system  under 
study  before  embarking  on  any  technique  that  will  automati¬ 
cally  reject  data  behaving  as  outliers.  There  is  inherent 
variability  in  any  measuring  process.  However,  this  variabl- 
ity,  which  is  the  combination  of  many  small  random  errors  can 
be  modeled  statistically.  Any  data,  then,  that  do  not  conform 
to  the  statistical  model  are  blunders.  These  blunders  may 
arise  from  many  souces  such  as  recording  errors,  data  tran¬ 
smission  errors,  or  point  misidentif ication,  and  their  magni¬ 
tude  may  range  from  very  small  to  very  large.  It  is  useful 
to  divide  blunders  into  two  categories.  First,  there  are 
large  blunders  that  are  detectable  without  having  to  resort 
to  rigorous  statistical  methods  as  suggested  by  El-Hakim. 
(El-Hakim  and  Zeiman,  1984)  Second,  there  are  blunders  of 


2. 


2. 1  Assumptions 


n 


There  are  several  assumptions  required  for  developing 
a  blunder  detection  scheme  based  upon  the  examination  of 
residuals  from  a  least  squares  adjustment.  First,  there  are 
four  assumptions  concerning  the  observational  errors. 

1.  E(e  )  =  0  which  implies  that  E(v  )  =0  as  well.  This 

i  i 

is  demonstrated  in  Appendix  J. 

2.  Var(e  )  =  where  e  is  the  error  of  the  observation 

1  *  i 

of  unit  weight.  T0  will  hereafter  be  refered  to  as  the 

2  — 

reference  variance.  This  implies  that  Var(V)  =  Q, 
where  V  is  the  vector  of  residuals. 


3.  Cov( e  ,e  )  =  0  for  i*j ,  which  means  that  there  is  no 

i  j 

correlation  among  the  observational  errors. 

2 

4.  e  N(0,Yo)  which  means  that  the  individual  residuals 

i 

are  also  distributed  as  a  normal,  V  is  distributed 
as  a  multivariate  normal,  and  parametric  statistical 
tests  may  be  used. 

These  four  assumptions  provide  a  basis  for  using  residuals 
for  blunder  detection  due  to  the  relationship  between  the 
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small  magnitude  which  are  not  readily  obvious  but  can  cause 
the  solution  to  be  invalid  for  its  intended  purpose.  Blunders 
in  this  category  are  very  difficult  to  indentify  without  sta¬ 
tistical  analysis  and  will  be  the  primary  target  of  the 
algorithm  to  be  developed.  It  follows  that  any  technique  for 
detecting  blunders  of  the  second  category  would  certainly 
work  on  large  blunders  overlooked  by  a  non- statistical 
method.  A  blunder,  then,  is  an  observation  that  does  not 
behave  statistically  as  one  would  expect  regardless  of  the 
cause.  The  question  of  what  to  do  with  blunders,  once  found, 
will  be  discussed  later  in  this  paper. 

1. 4  Thesis  Organization 

The  remainder  of  this  paper  is  organized  to  accomplish 
the  thesis  objective.  Chapter  2  will  propose  an  algorithm  for 
blunder  detection  using  a  sequential  least  squares  adjustment 
procedure.  Chapter  3  will  present  an  evaluation  of  the  algor¬ 
ithm  and  Chapter  4  will  provide  the  results  of  how  the  algor¬ 
ithm  performed  on  a  simple  data  set.  Finally,  Chapter  5  will 
summarize  how  well  the  algorithm  satisfies  the  requirements 
in  Section  1.  2  and  offers  questions  requiring  further  study. 
The  theoretical  basis  for  the  algorithm  is  developed  in  the 
Appendices  and  throughout  the  text;  equations  will  be  intro¬ 
duced  that  are  fully  developed  in  the  appendices.  Also, 
Appendix  A  contains  a  list  of  the  symbols  used  in  the  least 
squares  model  presented  in  this  thesis. 
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residuals  and  observational  errors  developed  in  Appendix  J. 
There  are  additional  assumptions  required  which  concern  the 
observations.  These  assumptions  will  be  necessary  when  a 
sequential  adjustment  method  will  be  used. 

1.  The  weight  matrix  of  the  observations  is  block 
diagonal.  This  implies  that  individual  observations 
are  independent,  or  at  least  the  observations  may  be 
divided  into  small  independent  blocks.  Observations 
can  then  be  accepted  or  rejected  independently  with¬ 
out  any  inference  concerning  other  observations. 

2.  The  total  number  of  blunders  in  the  observation  data 
is  small  and  the  redundancy  exceeds  ,  by  far  ,  the 
number  of  blunders.  If  this  is  not  the  case,  the  set 
of  observations  reflects  problems  of  a  different 
nature  such  as  data  transmission  or  modeling  errors. 

3.  The  observations,  less  any  blunders,  should  form  a 
well  conditioned  algebraic  system.  Furthermore,  at 
least  (u  +  rl)  observations  must  agree  with  assumed 
stochastic  properties  and  form  the  well  conditioned 
algebraic  system.  The  variables  u  and  rl  are  the 
number  of  parameters  and  initial  redundancy.  There 
may  be  no  systemmatic  errors  and  there  must  be  a 
correct  initial  solution  from  which  the  sequential 
adjustment  may  proceed. 

4.  The  observations  to  be  added  'p'  at  a  time  during  a 


sequential  adjustment  can  be  added  in  a  manner  such 
that  the  rank  of  the  cofactor  matrix  for  residuals 
corresponding  to  the  '  p'  observations  is  p.  This  is 
important  because  the  algorithm  to  be  developed  will 

-It 

require  that  Q  and  [Q+BN  B]be  inverted. 

22  2212 

All  of  these  assumptions,  including  the  assumption  of  normal¬ 
ity  of  the  observational  errors,  are  reasonable  for  most  mea¬ 
suring  systems.  Anyone  applying  a  blunder  detection  algorithm 
during  an  adjustment  should  have  studied  the  system  suffi¬ 
ciently  to  verify  the  validity  of  these  assumptions. 

2. 2  Key  Relationships  in  Sequential  Adjustments 

A  least  squares  solution  from  observation  equations 
being  added  sequentially  will  produce  results  identical  to 
those  of  other  approaches.  However,  the  observation  equation 
method  lends  itself  to  very  simple  partitioning  and  permits 
working  with  the  observations  directly  and  individually. 
Appendices  B  and  C  show  how  a  set  of  observation  equations 
can  be  partitioned  into  a  base  group  and  a  test 
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group.  The  resulting  residuals  are  then 
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These  residuals  can  be  estimated  using  only  the  observations 
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and  V  are  functions  of  only  the  observations  in  the  base 
1  2 

group  while  the  V  and  V  are  functions  of  the  observations 

1  2 

A  A 

in  both  the  base  and  test  groups.  V  and  V  ,  though,  may  be 

1  2 

expressed  as  functions  of  V  and  V  by 

1  2 


A 

V 


A 

V 
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=  V  -  Q  Q  V 
1  12  22  2 

_-l 

=  Q  Q  V 
2  22  2 
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These  two  equations  are  very  useful  because  they  permit  the 
total  sum  of  weighted,  squared  residuals  to  be  computed  for 
each  step  of  the  sequential  adjustment  as  the  sum  of  the 
effects  of  the  base  group  and  the  effects  of  the  test  group. 
From  Appendix  G, 

t  a  t  a  b  _ -  1 

VWV  =  V  WV+V  QV  2.  5 

1  1  1  2  22  2 

This  is  a  key  relationship  because  the  residual  esti¬ 
mates  corresponding  to  the  observations  in  the  base  group 
are  actually  the  residuals  for  the  same  observations  prior  to 
adding  the  '  p'  observations  of  the  test  group  sequentially. 
The  total  sum  of  weighted,  squared  residuals  can  be  computed 
for  each  step  of  the  sequential  adjustment  and  carried  for¬ 
ward  for  the  next  group  of  ’  p'  observations.  Appendix  H 
explains  also  that  the  two  quadratic  forms  on  the  right  side 
of  equation  2.  5  are  independent  and  distributed  as 
CHI-SQUARE,  which  follows  the  assumption  that  the  observa¬ 
tional  errors  and,  hence,  the  residuals  are  normaly  distri¬ 
buted.  These  two  distributions  are 

V  t  W  V  CHI-SQUARE(rl)  xY,1  2.6 

111 

t  _-l  i 

V  Q  V  — '  CHI-SQUARE(p)  xY(  2.7 

2  22  2 

where  rl  and  p  are  the  degrees  of  freedom  of  the  two  CHI- 


The  two  quadratic  forms  will  be  the  basis  for  a 
blunder  detection  algorithm.  However,  it  is  first  necessary 
to  determine  whether  an  a  priori  or  a  posteriori  reference 
variance  should  be  used. 

2.  3  A  Priori  Versus  A  Posteriori  Reference  Variance 

There  are  two  distinct  approaches  to  blunder  detection 
that  depend  on  whether  or  not  an  a  priori  reference  variance 
is  available.  Stefanovic  is  an  advocate  of  using  an  a  priori 
variance  (Stefanovic,  1981).  He  assumes  that  the  mathematical 
models  and  measurements  systems  are  sufficiently  tested  in 
advance  to  eliminate  systemmatic  errors  and  determine  an  a 
priori  reference  variance.  He  also  points  out  that  the  a  pos¬ 
teriori  variance  can  always  be  tested  against  the  a  priori 
variance  using  a  simple  F-test.  He  finds  that  the  use  of  an  a 
posteriori  variance  in  applications  such  as  Pope's  Tau  dis¬ 
tribution  (Pope,  1976)  has  a  serious  disadvantage.  A  blunder 
among  the  observations  inflates  the  estimated  variance  and 
makes  any  statistical  test  incorporating  an  a  posteriori 
estimate  less  likely  to  reject  blunders  of  a  magnitude  small 
enough  to  have  been  overlooked  by  nonstatistical  methods.  His 
dual  CHI-SQUARE  test  (Stefanovic,  1981)  examines  both  the 
base  group  and  the  test  group  using  equations  2. 6  and  2.  7. 


Pope  (Pope  1976),  however,  argues  that  in  reality,  the  actual 
variance  and  the  a  priori  variance  differ  significantly. 
Often,  the  a  priori  variance  is  too  small  and  causes  too  many 
observations  to  be  rejected.  The  ideal  algorithm  for  blunder 
detection,  then,  should  not  require  either  reference  vari¬ 
ance,  and  thus  eliminate  the  apparent  weaknesses  of  both  of 
these  approaches. 

2. 4  The  Proposed  Test 

One  possible  test  for  blunders  that  does  not  require 
incorporating  either  an  a  pTiori  or  a  posteriori  reference 
variance  is  an  F-test.  Because  the  two  quadratic  forms 
(equations  2.6  and  2.7)  are  independently  distributed  as 
CHI-SQUARE,  by  definition  the  test  statistic 

t  _-l 

[(  V  Q  V  )/p] 

2  22  2 

T.  S.  =  _  2.8 

/*>  t  /\ 

[(V  W  V  )/rl] 
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is  distributed  as  an  F  with  p  and  rl  degrees  of  freedom.  The 
test  statistic  is  compared  to  a  critical  value  defined  by  the 
F  statistic  with  p  and  rl  degrees  of  freedom  at  a  given  level 


of  signif icance.  The  'p  observations  of  the  test  group  are 
rejected  if  the  test  statistic  exceeds  the  F  statistic  at  the 
required  level  of  significance.  The  test  simply  determines 
whether  the  observations  in  the  test  group  have  more  vari¬ 
ability  than  those  of  the  base  group.  Rejection  occurs  when 
the  test  group  does  demonstrate  statistically  greater  vari¬ 
ability  than  the  base  group.  This  test  has  validity  when  the 
observational  errors  and  corresponding  residuals  have 
expected  values  of  zero.  The  test  is  also  independent  of  any 
a  priori  variance,  and  simply  requires  that  consistent  accu¬ 
racy  and  precision  in  the  observations  be  maintained.  The 
statistical  test  should  proceed  as  follows: 

Ho  :  The  observations  of  the  test  group  have  a  variance 
less  than  or  equal  to  that  of  the  base  group.  Or, 


^r 

test 


T 

base 


Ha  :  The  observations  of  the  test  group  have  a  variance 
greater  than  that  of  the  base  group.  Or, 


^r 

test 


"T 

base 


Reject  Ho  if  T.  S.  >  F(®<,p,rl). 

Even  though  this  test  does  not  rely  on  an  a  priori 


variance,  it  does  require  that  the  initial  solution  be  sta- 


mentioned  now  because  the  sequential  solution  must  build  upon 
an  initial  solution.  If  there  is  a  requirement  to  satisfy  an 
a  priori  reference  variance,  this  test  still  permits  the  a 
posteriori  estimate  to  be  tested  using  an  F-test.  The  advan¬ 
tage,  though,  is  that  the  proposed  sequential  technique  does 
not  allow  the  statistically  detectable  blunders  to  inflate 
the  estimate  of  the  variance.  The  proposed  test  provides  a 
truer  picture  of  the  variability  in  the  measurement  system 
and  a  better  comparison  of  the  a  priori  and  a  posteriori 
reference  variances  when  such  a  comparison  is  required. 


3.  1  Advantages  of  A  Sequential  Algorithm 

The  proposed  algorithm  has  a  number  of  strengths  or 
advantages  when  compared  to  many  simultaneous  adjustment 
blunder  detection  methods.  First,  it  has  a  sound  theoretical 
basis.  Unlike  many  techniques,  the  F-test  of  this  algorithm 
does  not  rely  on  an  arbitrary  3-sigma  cutoff  or  similar  arti¬ 
ficial  critical  value.  It  also  does  not  rely  on  a  restrictive 
assumption  that  the  data  can  contain  at  most  one  blunder  as 
do  methods  such  as  rejecting  the  observation  with  the  largest 
residual  or  Baarda's  data  snooping.  Another  theoretical 
advantage  is  that  this  test  does  not  rely  on  either  an  a 
priori  or  a  posteriori  variance.  The  sequential  method 
rejects  blunders  before  they  can  contaminate  the  estimate  of 
the  reference  variance.  The  standard  F-test  is,  then,  a  bet¬ 
ter  indication  of  how  the  actual  measurement  system  is  behav¬ 
ing  relative  to  an  a  priori  variance  if  such  a  comparison  is 
required. 

Aside  from  these  theoretical  advantages  over  tradi¬ 
tional  approaches  to  blunder  detection,  this  algorithm  also 
has  computational  and  practical  advantages  over  those  simul- 


taneous  adjustment  methods  that  are  theoretically  sound.  The 
sequential  adjustment  offers  much  greater  computational  effi¬ 
ciency  in  several  ways.  In  cases  where  more  than  one  blunder 
may  be  present,  blunder  location  following  a  batch  adjustment 
requires  a  brute  search,  while  the  sequential  method  provides 
location  at  the  same  time  that  a  blunder  is  detected.  Also, 
even  though  sequential  adjustments  require  a  large  number  of 
multiplications,  combining  the  actual  adjustment  and  blunder 
detection  algorithm  into  one  step  offers  much  greater  compu¬ 
tational  efficiency.  Data  rejection  for  a  sequential  adjust¬ 
ment  simply  requires  deletion  of  a  rejected  observation  which 
is  much  faster  than  obtaining  an  new  simultaneous  solution. 

The  simultaneous  adjustment  and  blunder  detection 
technique  is  also  very  practical.  There  are  many  situations 
where  data  is  added  to  an  adjustment  after  the  adjustment  has 
already  been  completed  and  old  observations  discarded.  The 
sequential  algorithm  permits  the  new  data  to  be  examined  and 
rejected  if  required.  This  procedure  may  prevent  bad  data 
acquired  later  from  ever  contaminating  a  good  data  set.  One 
may  also  examine  the  effect  of  the  observations  on  the  param¬ 
eter  estimates  and  the  covariance  matrix  of  these  estimates. 
Then,  there  is  the  possibility  for  remeasurement  to  avoid 
losing  a  data  point.  Finally,  the  sequential  adjustment  tech¬ 
nique  can  be  run  on  small  computers  as  well  as  large,  and  may 


provide  a  blunder  detection  capability  for  microcomputers 
that  was  previously  unavailable. 

3. 2  Limitations  of  the  Algorithm 

The  proposed  algorithm  has  two  limitations.  The  first 
of  these  is  that  the  residuals  must  be  distributed  as  a  mul¬ 
tivariate  normal  distribution.  If  this  is  not  the  case  then 
the  F-test  is  not  valid  because  the  two  CHI-SQUARE  distribu¬ 
tions  forming  the  test  statistic  are  not,  in  fact,  CHI-SQUARE 
distributions.  The  normality  assumption  is  not  a  particularly 
serious  limitation  since  there  are  many  real  world  problems 
about  which  this  assumption  is  valid.  However,  one  should 
exercise  caution  when  working  with  an  untested  system.  The 
second  limitation  requires  more  attention.  A  sequential 
adjustment  relys  on  an  initial  solution.  This  initial  solu¬ 
tion  is  the  foundation  upon  which  the  entire  system  will  be 
adjusted  and  the  final  results  and  quality  of  the  blunder 
detection  performance  judged.  A  good  initial  solution 
requires  enough  correct  data  and  initial  redundancy  to  detect 
small  blunders  early  in  the  sequential  adjustment  and  prevent 
a  case  where  acceptable  observations  become  unacceptable  as 
the  estimates  of  the  parameters  converge.  This  requirement 
for  a  good  initial  solution  requires  more  discussion. 
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3. 3  The  Initial  Solution 

It  is  important  that  the  definition  of  an  initial 
solution  be  clearly  understood.  There  are  two  approaches  to 
obtaining  an  initial  solution  for  a  sequential  adjustment. 
One  method  involves  performing  a  simultaneous  adjustment  on 
the  first  (u+rl)  observations.  These  observations  must  be 
selected  so  that  the  system  is  solvable.  The  rank  of  the  nor¬ 
mal  equations  coefficient  matrix  must  be  at  least  u.  The 
other  begins  using  estimates  of  the  parameters  and  and  the 
sequential  adjustment  procedure  with  the  initial  observation. 
The  key  point  is  that  from  the  point  where  the  number  of 
observations  equals  the  number  of  parameters,  the  two  solu¬ 
tions  are  identical.  When  (u+rl)  observations  have  been 
added,  both  approaches  have  the  same  solution.  However,  ini¬ 
tial  solution  for  blunder  detection  is  not  merely  the  start¬ 
ing  point  for  the  sequential  adjustment.  For  blunder  detec¬ 
tion,  initial  solution  refers  specifically  to  the  solution 
upon  which  the  blunder  detection  algorithm  employed  in  the 
sequential  adjustment  will  be  statistically  meaningful. 
Either  method  may  be  used  as  long  as  the  first  (u+rl)  obser¬ 
vations  are  blunder  free  and  provide  sufficient  redundancy  to 
make  the  F-test  meaningful  for  the  observations  to  be  added 
early  in  the  sequential  adjustment. 
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One  critical  task  of  the  sequential  adjustment  is  to 
obtain  the  (u+rl)  blunder  free  observations  for  the  initial 
solution.  If  blunders  are  present  in  the  initial  solution, 
the  entire  sequential  adjustment  is  invalid  because  it  pro¬ 
ceeds  from  an  incorrect  starting  point.  The  F-test  of  the 
algorithm  is  invalid  also  because  the  quadratic  form  in  the 
denominator  becomes  artificially  inflated,  permitting  even 
more  blunders  to  be  incorporated  into  the  solution  as  the 
adjustment  proceeds.  The  problem  obviously  requires  that 
these  observations  be  taken  with  particular  care  but  does  not 
pose  an  insurmountable  obstacle  for  the  sequential  algorithm 
to  be  implimented.  One  obvious  measurement  technique  would 
be  to  make  multiple  measurements  for  the  observations  of  the 
initial  solution.  As  Stefanovic  states,  multiple  measurements 
do  not  increase  the  precision  of  the  measurement,  but  they  do 
help  identify  blunders.  (Stefanovic,  1981)  In  this  manner, 
the  initial  observations  can  be  '  checked  out  '  and  obvious 
blunders  eliminated  using  nonstatistical  methods  similar  to 
those  of  El  Hakim  (El  Hakim  &  Forstner,  1984)  All  remaining 
observations  need  only  be  measured  once  because  the  algorithm 
will  detect  the  blunders  in  this  group.  The  practice  of  tak¬ 
ing  multiple  measurements  is  also  directly  related  to  the 
concept  of  weights.  The  initial  solution  should  always  con¬ 
tain  the  observations  with  the  highest  possible  weight.  The 


weight  of  the  observation  reflects  certainty,  whether  through 
multiple  measurements  or  through  more  precisely  made  measure¬ 
ments  such  as  first-order  versus  second-order  survey  coordi¬ 
nates.  Such  an  approach  is  very  logical  but  must  sometimes 
be  violated.  One  must  always  insure  that  the  initial  observa¬ 
tions  form  a  well  conditioned  algebraic  system  and  there 
should  be  no  singularities  in  the  normal  equation  coefficient 
matrix.  The  sequential  adjustment  can  be  simulated  to  iden¬ 
tify  these  restrictions  and  the  initial  observations  made 
accordingly.  Finally,  one  may  use  the  simultaneous  adjustment 
blunder  detection  algorithms,  such  as  those  proposed  by  Pope 
and  Stefanovic,  to  search  for  blunders  in  the  initial  solu¬ 
tion.  The  initial  solution,  though,  requires  fewer  brute 
searches  for  the  blunders  than  would  a  simultaneous  adjust¬ 
ment  containing  all  of  the  observations.  The  sequential 
algorithm  does  not  require  any  more  care  in  measuring  than  do 
the  simultaneous  methods.  After  the  initial  solution  is 
obtained,  there  are  fewer  measurements  required  for  the 
sequential  adjustment  because  it  is  unnecessary  to  make  the 
multiple  measurements  for  the  remaining  observations.  There¬ 
fore,  the  apparent  limitation  of  requiring  good  data  for  the 
initial  solution  can  be  overcome  through  proper  data  measure¬ 
ment  and  point  selection  methods. 


The  other  key  task  in  the  sequential  adjustment  is  to 
select  the  proper  initial  redundancy.  This  task  is  important 
for  two  reasons.  First,  the  F-test  should  work  as  well  for 
data  added  early  in  the  adjustment  as  for  data  added  later 
when  the  redundancy  is  higher.  If  the  initial  redundancy  is 
too  low,  then  the  F-test  will  be  too  insensitive.  This  may 
mean  that  observations  accepted  early  in  the  sequence  might 
have  been  rejected  if  added  to  the  soultion  later  when  the 
redundancy  was  higher.  The  test  may  be  particularly  insensi¬ 
tive  when  the  first  few  observations  are  added  and  the  number 
of  observations  being  added  sequentially  (p)  is  equal  to  one. 
The  second  reason,  related  to  the  first, is  that  if  the  ini¬ 
tial  test  is  too  insensitive,  blunders  may  still  be  allowed 
into  the  solution  before  enough  redundancy  is  achieved  to 
make  them  statistically  detectable. 

There  are  at  least  two  means  of  determining  if  the 
initial  redundancy  is  sufficient.  One  method  is  to  check  if 
the  observations  being  added  after  the  initial  solution  pro¬ 
duce  the  same  results  when  added  in  reverse  order.  If  the 
results  are  the  same,  then  the  test  performed  well  for  obser¬ 
vations  added  early  and  late  in  the  adjustment.  If  the 
results  are  not  the  same,  then  the  initial  redundancy  was  not 
sufficient  to  insure  that  the  observations  added  early  were 
properly  tested.  In  this  case,  one  should  increase  the  ini- 
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tial  redundancy.  Adding  the  data  in  reverse  order  is  a  simple 
empirical  test  that  can  be  run  on  different  sets  of  data  to 
establish  the  desired  initial  redundancy  for  a  given  measure¬ 
ment  system.  Often,  examination  of  the  F-tables  for  the 
required  level  of  significance  and  p  will  give  a  good  idea  of 
what  initial  redundancy  is  required  before  the  F-statistic  is 
small  enough  to  provide  a  meaningful  test.  The  selection  may 
be  subjective  or  based  upon  previous  experience  with  a  given 
measurement  system. 

An  alternate  approach  would  be  to  compute  the  maximum 
undetectable  blunder.  (Stefanovic,  1981)  The  maximum  unde¬ 
tectable  blunder  could  be  computed  for  the  initial  solution 
and  the  final  solution  for  comparison.  If  they  are  nearly 
equal,  then  the  initial  solution  contained  enough  initial 
redundancy.  Nearly  equal  can  be  either  subjective,  or  the 
result  of  propagating  the  difference  between  the  initial 
solution  maximum  undetectable  blunder  and  the  final  solution 
maximum  undetectable  blunder  through  the  solution  to  deter¬ 
mine  if  the  there  is  a  noticeable  adverse  effect.  This  is 
also  an  empirical  test  and  will  probably  differ  with  diffe¬ 
rent  measurement  systems.  Finally,  the  level  of  significance 
also  impacts  the  way  the  test  treats  data  added  early  and 
will  be  discussed  in  the  next  section. 
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3.4  Type  I  and  Type  II  statistical  Errors 

The  selection  of  the  level  of  significance  for  the 
F-test  in  a  sequential  adjustment  is  very  important  because 
it  is  also  the  probability  of  a  Type  I  error.  The  Type  I 
error  occurs  when  one  rejects  the  null  hypothesis  when  the 
null  hypothesis  is  actually  true.  A  type  II  error  occurs  when 
one  fails  to  reject  a  false  null  hypothesis.  For  the  statis¬ 
tical  test  outlined  in  Section  2.4,  the  Type  I  error  results 
in  rejecting  a  good  data  point  while  the  Type  II  results  in 
allowing  a  blunder  into  the  solution.  The  costs  associated 
with  the  Type  I  error  include  that  of  remeasurement  or 
requiring  a  more  sophisticated  matrix  inversion  method  due  to 
weakened  geometry.  Type  II  errors  mean  that  the  adjustment 
contains  blunders  and  is  invalid  for  its  intended  purpose. 
Type  I  and  Type  II  errors  have  an  inverse  relationship,  mean¬ 
ing  that  decreasing  the  level  of  significance  increases  the 
probability  of  a  Type  II  error.  In  order  to  insure  that  the 
probability  of  a  Type  II  error  is  as  small  as  possible,  the 
level  of  significance  should  be  as  large  as  possible.  A  lar¬ 
ger  level  of  significance  will  require  that  more  data  be 
rejected  and  will  require  lower  initial  redundancy  for  the 
F-statistic  to  become  small  enough  to  have  statistical  mean- 


There  have  been  many  articles  concerning  the  optimum 
levels  of  Type  I  and  Type  II  errors.  Molennaar  provides  a 
summary  of  the  subject  (Molenaar,  1981).  However,  there  are 
some  comments  that  should  be  made  concerning  these  errors  and 
a  sequential  blunder  detection  technique.  First,  one  must 
assume  that  the  goal  is  to  minimize  costs.  Therefore,  any 
decision  made  concerning  the  optimum  level  of  significance 
should  result  in  the  least  expensive  result.  Type  II  errors 
usually  have  the  most  expensive  consequences  because  blunders 
adversely  effect  the  final  solution.  Type  I  errors,  though, 
may  be  overcome  in  several  ways.  First,  the  sequential 
adjustment  permits  the  observation  to  be  remeasured  and  added 
later.  On  line  measuring  systems  can  then  employ  large  signi¬ 
ficance  levels  because  the  costs  of  remeasurement  are  low.  A 
high  significance  level  is  also  desired  when  adding  data  to 
an  old  solution.  Too  many  good  data  sets  have  been  contami¬ 
nated  after  the  fact  because  observations  were  added  later 
that  did  not  conform  to  the  original  data.  The  high  level  of 
significance  serves  as  a  tighter  filter.  Also,  the  sequential 
adjustment  method  can  be  simulated,  and  contingencies  made 
for  data  rejection.  Thus,  measurements  in  excess  of  those 
required  can  be  made,  and  a  higher  level  of  significance  used 
without  having  to  obtain  additional  measurements  later. 
Finally,  the  data  does  not  always  have  to  be  rejected.  The 
weights  can  be  altered,  but  this  can  be  expensive  as  well. 


A  key  point  concerning  Type  I  and  Type  II  errors  is 
that  the  sequential  adjustment  provides  far  greater  flexibil¬ 
ity  for  dealing  with  Type  I  errors  than  does  the  simultaneous 
adjustment.  Such  flexibility  with  Type  I  errors,  in  turn, 
provides  better  opportunities  for  preventing  far  more  costly 
Type  II  errors.  Again,  each  measurement  system  should  be 
empirically  tested  to  resolve  which  level  of  significance  is 
optimal  for  the  adjustment. 


4. 1  Digital  Sequential  Adjustment  Blunder  Detection 

Blunder  detection  algorithms  are  most  practical  when 
they  can  be  efficiently  incorporated  into  digital  computer 
adjustment  programs.  The  sequential  algorithm  is  particu¬ 
larly  easy  to  program  because  key  quantities  for  the  F-test 
are  computed  during  the  actual  adjustment.  Appendix  F  demons¬ 
trates  that  a  key  portion  of  the  Kalman  gain  matrix  can  be 
used  to  compute  the  numerator  of  the  test  statistic.  Sequen¬ 
tial  reduction,  after  data  rejection,  is  also  very  simple, 
requiring  only  several  sign  changes  in  the  input  data.  A 
sequence  of  critical  steps  for  a  computer  program  for  a 
sequential  adjustment  with  the  F-test  for  blunder  detection 
is  in  Appendix  K.  The  actual  program  used  to  test  the  algor¬ 
ithm  is  written  in  Fortran  77  for  an  IBM  -  370  computer,  and 
the  program  listing  is  provided  in  Appendix  K. 


4. 2  The  Data  to  be  Tested 

The  purpose  of  this  test  is  to  illustrate  the  flexi¬ 
bility  of  the  algorithm,  its  programmability,  and  the  types 
of  questions  that  must  be  answered  when  applying  the  algor¬ 
ithm  to  a  specific  measuring  system.  A  straight  line  is  the 
simplest  of  models,  is  easily  understood,  and  does  not 
require  linearization.  There  are  two  methods  of  obtaining 
test  data  for  a  straight  line.  The  first  method  consists  of 
generating  a  perfect  data  set  and  then  introducing  random 
error  and  blunders.  The  second  method  requires  actual  mea¬ 
surements  on  an  actual  straight  line  to  be  made  and  the  mea¬ 
surements  become  the  observation  data.  Either  method  will 
provide  a  test  data  set,  however,  the  second  method  is  used 
for  three  reasons.  First,  the  requirement  for  making  measure¬ 
ments  is  more  realistic  and  does  not  provide  a  priori  know¬ 
ledge  of  a  blunder's  existence  or  location.  Second,  the  con¬ 
cept  of  multiple  measurements  for  the  observations  to  be 
incorporated  into  the  initial  solution  can  be  employed. 
Finally,  the  use  of  a  perfect  data  set  with  random  errors  and 
blunders  introduced  may  not  reflect  the  variability  of  an 
actual  measuring  system  and  can  thus  result  in  an  incomplete 


understanding  of  how  the  algorithm  is  performing.  Any  test 
conducted  on  a  set  of  actual  measurements  will  simply  iden¬ 
tify  those  data  points  that  are  statistically  inconsistent 
with  the  remaining  observations  without  any  a  priori  know¬ 
ledge  of  blunders. 

Data  set  one  consists  of  thirty  points  that  lie  along 
a  straight  line.  Tables  4. 1  and  4. 2  contain  the  data  sets 
with  initial  redundancy  of  3  and  4  respectively.  The  first 
data  points  for  each  set  have  a  weight  of  five  because  these 
measurements  were  made  five  times.  These  are  the  points  that 
will  be  used  in  the  initial  solution.  All  other  points  were 
measured  once  and  have  a  weight  of  unity.  At  the  top  of  each 
data  set  are  the  total  number  of  observations,  number  of 
model  parameters,  the  initial  redundancy,  and  number  of 
observations  to  be  added  sequentially.  The  data  can  be  added 
sequentially  in  any  order  desired.  The  table  entries  are  the 
observation,  the  coefficients  of  the  model,  the  weight  of  the 
observation,  and  the  model  constant  (D(i))  which  is  zero  for 
this  case.  For  the  straight  line,  the  model  is 

Y  =  A  X  +  B 

where  A  is  the  slope  parameter, B  is  the  intercept  parameter, 
Y  is  the  Y-coordinate  which  is  measured,  and  X  is  the 
X-coordinate  which  is  assumed  to  be  error  free.  The 
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results  will  be  same  as  those  for  an  adjustment  where  both  X 
and  Y  are  treated  as  observations.  The  choice  of  initial 
redundancy  for  data  set  one  is  subjective.  The  F-statistic 
does  not  appear  small  enough  for  observations  added  sequen¬ 
tially  one  at  a  time  until  the  initial  solution  has  a  redun¬ 
dancy  of  three.  The  object  is  to  select  an  initial  redundancy 
high  enough  to  enable  the  algorithm  to  detect  blunders  early 
in  the  solution.  The  straight  line  has  two  parameters  so 
that  an  initial  redundancy  of  three  requires  five  observa¬ 
tions  for  the  initial  solution.  The  abscissa  range  is  0  to 
7.25  inches,  and  the  ordinate  range  is  0  to  5.05  inches.  Mea¬ 
surements  were  made  with  an  engineer’s  scale  (60  scale)  and 
converted  to  decimal  inches.  The  reference  variance  is  not 
known  a  priori,  but  this  is  not  important  because  the  algor¬ 
ithm  is  not  based  upon  an  a  priori  reference  variance.  The 
concept  of  multiple  measurements  was  used  to  select  data  for 
an  initial  solution.  Finally,  the  points  measured  for  the 
initial  solution  are  spread  over  the  entire  length  to  insure 
that  there  are  no  weaknesses  in  the  geometry  as  observations 
are  added  sequentially.  Data  set  two  is  the  same  as  data  set 
one  except  the  point  for  X=6.00  is  measured  four  more  times 
and  added  to  the  initial  solution.  Data  set  two,  then,  has  an 
initial  redundancy  of  4. 
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Table  4.  2  -  Data  Set  2,  Initial  Redundancy  = 


n  =  30 


u  =  2 


rl  =  3 


P  = 


Y 

X 

INT 

w 

D 

0.  672 

-1.  00 

-1.  0 

5 

0.  0 

1.  353 

-2.  00 

-1.  0 

5 

0.  0 

2.  023 

-3.  00 

-1.  0 

5 

0.  0 

3.  375 

-5.  00 

-1.  0 

5 

0.  0 

4.  713 

-7.  00 

-1.  0 

5 

0.  0 

4.  041 

-6.  00 

-1.  0 

5 

0.  0 

0.  167 

-0.  25 

-1.  0 

1 

0.  0 

0.  333 

-0.  50 

-1.  0 

1 

0.  0 

0.492 

-0.  75 

-1.  0 

1 

0.  0 

0.  842 

-1.  25 

-1.  0 

1 

0.  0 

1.  008 

-1.  50 

-1.0 

1 

0.  0 

1.  175 

-1.  75 

-1.  0 

1 

0.  0 

1.  508 

-2.  25 

-1.  0 

1 

0.  0 

1.  683 

-2.  50 

-1.  o 

1 

0.  0 

1.  850 

-2.  75 

-1.  0 

1 

0.  0 

2.  175 

-3.  25 

-1.  0 

1 

0.  0 

2.  342 

-3.  50 

-1.  0 

1 

0.  0 

2.  525 

-3.  75 

-1.  0 

1 

0.  0 

2.  683 

-4.  00 

-1.  0 

1 

0.  0 

2.  842 

-4.  25 

-1.  0 

1 

0.  0 

3.  025 

-4.  50 

-1.  0 

1 

0.  0 

3.  192 

-4.  75 

-1.  0 

1 

0.  0 

3.  533 

-5.  25 

-1.  0 

1 

0.  0 

3.  683 

-5.  50 

-1.  0 

1 

0.  0 

3.  867 

-5.  75 

-1.  0 

1 

0.  0 

4.  200 

-6.  25 

-1.  0 

1 

0.  0 

4.  367 

-6.  50 

-1.  0 

1 

0.  0 

4.  550 

-6.  75 

-1.  0 

1 

0.  0 

4.  883 

-7.  25 

-1.  0 

1 

0.  0 

5.  050 

-7.  50 

-1.  0 

1 

0.  0 

Note:  Observation  25  was  remeasured  4 
more  times  and  added  to  the  initial 
solution  data  to  change  Data  Set  1 
into  Data  Set  2. 


FIGURE  4.1 


TEST  DATA  PLOT 


Experiment  Design 


4.  3 


Six  tests  are  required  to  illustrate  how  the  blunder  detec¬ 
tion  algorithm  works.  First,  three  tests  are  run  on  data  set 
one  to  determine  the  effect  of  altering  the  level  of  signifi¬ 
cance  while  initial  redundancy  and  the  order  in  which  the 
data  is  added  sequentially  are  kept  constant.  Tests  1,2,  and 
3  should  show  that  more  data  is  rejected  as  the  level  of  sig¬ 
nificance  becomes  larger.  Then,  an  additional  test  should  be 
run  on  data  set  one  at  the  same  level  of  significance  as  Test 
3,  but  with  the  observations  being  added  sequentially  in 
reverse  order.  Test  4  should  show  that  if  the  initial  solu¬ 
tion  does  not  contain  enough  redundancy,  or  is  otherwise 
incorrect,  the  same  data  points  will  not  be  rejected  when  the 
order  in  which  data  is  added  is  changed.  The  last  two  tests 
are  run  on  data  set  two  in  the  same  manner  as  Tests  3  and  4 
are  run  on  data  set  one.  However,  data  set  two  has  a  greater 
initial  redundancy.  Tests  5  and  6  may  show  that  the  same  data 
points  are  rejected  if  the  order  in  which  the  data  is  added 
is  reversed.  The  increase  in  initial  redundancy  strengthens 
the  initial  solution  and  enables  it  to  detect  blunders  early 
in  the  adjustment.  Finally,  the  same  data  points  should  con¬ 
sistently  be  rejected  for  this  group  of  tests.  A  lack  of  com¬ 
monality  of  data  rejected  may  indicate  that  the  algorithm  is 
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not  performing  as  it  should.  Table  4. 3  lists  each  test  and 
its  purpose.  Test  results  should  be  considered  by  comparing 
the  results  of  test  groupings.  Tests  1,2,  and  3  are  the  group 
that  illustrate  the  effects  of  varying  level  of  significance. 
Tests  3  and  4  should  show  that  the  order  in  which  data  is 
added  can  effect  the  results  when  the  initial  redundancy  is 
too  low.  And,  tests  5  and  6  should  be  compared  with  the 
results  of  tests  3  and  4  to  show  that  an  increase  in  initial 
redundancy  may  correct  the  problem  and  insure  that  order  has 
no  effect  on  which  data  points  are  rejected. 


4.4  Test  Results 

The  six  tests  demonstrate  the  use  of  the  sequential  adjust¬ 
ment  algorithm  proposed  in  this  paper.  Tables  4.4  and  4.5 
summarize  the  results.  Table  4.4  shows  the  number  of  obser¬ 
vations  rejected  for  each  test  as  the  level  of  significance, 
initial  redundancy,  or  order  in  which  data  are  added  sequen¬ 
tially  are  varied.  Tests  1,2,  and  3,  conducted  on  data  set 
one,  demonstrate  low  a  higher  level  of  significance  causes 

the  algorithm  to  reject  more  data.  Tests  3  and  4  show  that 
I 

when  level  of  significance  and  initial  redundancy  are  cons¬ 
tant,  the  order  in  which  observations  are  added  sequentially 
can  alter  which  data  points  are  rejected.  This  is  an  indica- 

i 


tion  that  the  initial  redundancy  may  be  too  low.  Tests  5  and 
6  reinforce  this  conclusion  because  by  adding  an  extra  point 
of  high  weight  to  the  initial  solution,  the  same  observations 
are  rejected  regardless  of  whether  the  data  is  added  in  the 
listed  order  or  reverse  order.  Tests  1  and  4  rejected  no 
observations. 

Table  4.5  shows  which  points  were  rejected  by  each 
test.  As  expected,  tests  2,3,5,  and  6  rejected  common  points. 
The  observations  corresponding  to  X=4. 25  and  X=5. 50  were 
rejected  each  time  and  were  even  borderline  points  for  test 
4.  This  commonality  is  important  because  it  indicates  that 
these  two  points  really  are  blunders  and  that  the  algorithm 
is  performing  as  it  should.  Test  2  shows  five  points 
rejected.  This  larger  number  of  points  rejected  is  also 
expected  because  test  2  had  the  highest  level  of  signifi¬ 
cance. 

These  six  tests,  run  on  data  sampled  from  a  straight 
line,  are  representative  of  those  which  would  be  required  for 
any  measurement  system  if  the  proposed  blunder  detection 
technique  is  to  be  used.  The  tests  illustrate  the  need  to 
carefully  plan  measurements,  the  requirement  for  a  strong 
initial  solution,  and  the  effects  of  varying  the  level  of 
significance.  The  test  results  also  confirm  that  the  algor¬ 
ithm  works,  at  least  for  the  straight  line  model.  The  same 
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I  Table  4.  5  -  Data  Rejected  For  Each  Test 


i 

Test 

X  Coord.  Rej. 

Obs.  No 

FSTAT1 

FSTAT2 

2 

3.  25 

15 

3.  67 

3.  18 

] 

2 

3.  50 

16 

4.45 

3.  18 

2 

4.  25 

19 

7.  26 

3.  10 

2 

5.  50 

23 

7.  93 

3.  03 

i 

2 

6.  00 

25 

4.  48 

3.  01 

3 

4.  25 

19 

5.  06 

4.49 

i 

>• 

3 

5.50 

23 

5.  82 

4.  38 

5 

4.  25 

19 

5.  32 

4.45 

§ 

5 

5.  50 

23 

6.  20 

4.  35 

< 

L  „ 

6 

4.  25 

19 

7.  36 

4.  67 

j 

6 

5.  50 

23 

7.  00 

4.  96 

Data 

rejected  when 

FSTAT1  > 

FSTAT2. 

FSTAT1  =  Test  Statistic 

« 

FSTAT2  =  F(c*  ,p,rl) 

For  test  4,  the  F  values  for  X  =  4. 25  ,  5.5: 
X  =  4.25  FSTAT1=  3.91  FSTAT2  =  4.60 

( 


X  =  5.50  FSTAT1  =  4.72  FSTAT2  =  4.96 


data  points  are  consistently  rejected,  and  the  algorithm 
behaved  as  expected.  Once  the  initial  redundancy  was 
increased,  the  same  observations  were  rejected  when  they  were 
added  sequentially  in  reverse  order.  These  tests  are  not 
intended  to  be  rigorous  for  all  measurement  systems,  but  they 
do  illustrate  the  use  of  the  algorithm,  its  programmability, 
and  the  types  of  questions  that  must  be  answered  when  apply¬ 
ing  the  algorithm  to  a  specific  measuring  system. 


5. 


Concluding  Remarks 
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5. 1  Summary  of  the  Algorithm  Proposed 


The  stated  objective  of  this  thesis  was  to  develop  an 
algorithm  for  detecting  blunders  in  observed  data  using  a 
sequential  least  squares  adjustment  technique.  The  algorithm 
developed  consists  of  an  F-test  that  compares  two  key  quad¬ 
ratic  forms.  The  test  statistic, 

t  _-l 

[(  V  Q  V  )/P] 

2  22  2 

T.  S.  =  .  5.  1 

A  t  /s 

[(V  W  V  )/rl] 

111 

is  easily  computed  for  each  step  of  the  sequential  adjust¬ 
ment.  The  '  p’  observations  being  added  sequentially  are 
rejected  if  the  T.  S.  >  F(  ,p,rl).  Tests  on  measured  data 
sampled  along  a  straight  line  model  indicate  that  the  algor¬ 
ithm  works  well  on  that  simple  model  as  long  as  the  initial 
solution  contains  enough  redundant  observations  of  high 
weight  to  insure  the  test  is  valid  for  the  observations  added 
early  in  the  adjustment.  The  requirements  for  the  algorithm 
have  been  met  as  follows: 

1.  This  algorithm  is  suitable  for  detection  of  blunders 
of  small  and  large  magnitude.  Large  magnitude  refers 
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to  those  blunders  large  enough  to  be  detected  using 
non- statistical  methods  (El-Hakim  and  Zeiman,  1984). 
The  sequential  technique  does  not  require,  however, 
a  special  step  to  find  these  large  blunders  because 
they  will  be  rejected  before  they  contaminate  the 
solution  and  inflate  the  covariance  matrix  of  the 
parameter  estimates.  However,  the  initial  solution 
must  contain  enough  redundant  observations  of  high 
weight  to  insure  that  the  algorithm  works  well  early 
in  the  adjustment  for  blunders  of  small  magnitude. 

2.  Blunder  location  is  given  immediately. 

3.  The  algorithm  is  computationally  efficient.  It  com¬ 
bines  the  blunder  detection,  data  rejection,  and  the 
actual  adjustment  into  one  step.  The  simutaneous  ad¬ 
justment  requires  an  adjustment  followed  by  a  search 
for  blunders,  and  another  simultaneous  adjustment 
if  data  are  rejected  as  blunders. 

4.  Tolerances  may  be  determined  by  means  of  confidence 
intervals  using  the  a  posteriori  reference  variance 
that  is  updated  after  each  observation  is  added  in 
sequence.  Also,  the  maximum  undetectable  blunder  may 
be  calculated  for  the  initial  and  final  solutions  to 
insure  that  the  algorithm  works  well  early  in  the 
sequential  adjustment. 

5.  A  priori  knowledge  is  not  required  concerning  number 


or  location  of  blunders  in  the  observations  that 
are  added  sequentially.  However, the  presence  of  very 
large  numbers  of  blunders  relative  to  the  number  of 
parameters  indicates  a  problem  with  either  model  or 
data  transmission.  Also,  there  must  be  adequate 
blunder  free  data  to  form  a  good  initial  solution. 

6.  The  statistical  measures  of  confidence  for  Type  I 
and  Type  II  errors  are  incorporated  in  the  algorithm 
and  may  be  varied  to  meet  the  requirements  of  the 
user.  Type  I  errors  are  controlled  directly  through 
selecting  a  specific  level  of  significance.  Using 
a  higher  probability  of  a  Type  I  error  permits  one 
to  indirectly  control  the  probability  of  a  more 
expensive  Type  II  error. 

As  far  as  the  challenges  listed  in  Section  1. 1  are  concerned, 
they  can  be  largely  overcome.  The  inversion  of  large  matrices 
may  be  avoided  using  a  sequential  technique  beginning  with 
the  first  observation  and  can  permit  the  algorithm  to  be  used 
on  personal  computers.  The  fact  that  the  true  residuals  are 
not  available  for  each  step  of  the  adjustment  does  not  prohi¬ 
bit  the  detection  of  blunders.  As  demonstrated  in  Appendix  E, 
the  residuals  may  be  expressed  in  terms  of  residual  estimates 
at  each  step  of  the  sequential  adjustment,  allowing  the  use 
of  the  key  quadratic  forms  in  equations  2.6  and  2.7.  Finally, 


the  difficulty  in  establishing  the  initial  redundancy  and 
initial  solution  can  be  solved  rather  easily  for  the  simple 
straight  line  tested.  That  test  illustrated  that  it  is 
desireable  to  establish  an  initial  solution  strong  enough  to 
allow  the  algorithm  to  work  well  early  in  the  adjustment.  A 
strong  initial  solution  will  also  preclude  accepting  observa¬ 
tions  early  in  the  sequential  adjustment  that  would  be 
rejected  as  blunders  once  the  parameter  estimates  begin  to 
converge.  There  are  several  questions  concerning  the  initial 
solution  that  still  require  further  study.  These  questions 
will  be  briefly  discussed  in  Section  5. 3. 


5.  2  Applications  for  Sequential  Blunder  Detection 


It  is  important  to  note  that  the  sequential  adjustment  blun¬ 
der  detection  algorithm  may  be  used  in  place  of  any  other 
least  squares  adjustment  method.  Sequential  adjustments  pro¬ 
duce  results  identical  to  simultaneous  adjustments.  The  key 
advantage  for  the  sequential  approach  is  that  it  provides 
much  greater  flexibility  than  do  simultaneous  methods.  In 
cases  where  there  are  already  solutions,  particularly  those 
where  observations  have  been  discarded,  new  data  can  be 
tested  to  insure  that  the  old  solution  does  not  become  conta- 


minated  if  the  new  data  does  contain  blunders.  The  sequential 
approach  also  permits  blunder  detection  for  those  systems 
that  rely  on  a  sequential  adjustment  for  the  solution.  Orbit 
determination  is  one  such  problem.  There  are  also  many  oppor¬ 
tunities  to  employ  a  sequential  technique  to  on  line  measure¬ 
ment  systems.  Finally,  this  algorithm  is  statistically  based 
and  does  not  rely  on  subjective  judgement  that  may  or  may  not 
be  correct.  Artificial  techniques  such  as  a  three-sigma 
cutoff  or  estimating  Q  with  Q  of  the  observations  are  weak, 
have  no  statistical  basis,  and  should  not  be  used.  The  compu¬ 
ter  programs  for  sequential  adjustment  are  no  more  complex 
than  elimination  methods  and  offer  several  computational 
efficiencies  for  blunder  detection.  The  'artificial  methods' 
appeal  due  to  their  computational  efficiency  is  no  longer 
valid,  particularly  when  one  considers  that  their  computa¬ 
tional  efficiency  is  seriously  degraded  if  the  wrong  observa¬ 
tions  are  discarded. 

5. 3  Topics  for  Further  Study 

Most  of  the  difficulties  with  implementing  the  sequential 
adjustment  algorithm  involve  achieving  the  initial  solution. 
While  there  are  some  empirical  tests  that  may  be  used,  meth- 


ods  such  as  running  the  adjustment  in  reverse  order  are 
somewhat  wasteful  and  should  be  avoided  if  possible.  However, 
the  requirement  for  a  blunder  free  initial  solution  cannot  be 
taken  too  lightly  and  poses  the  following  questions: 

1.  Is  there  a  way  to  determine  initial  redundancy  with¬ 
out  resorting  to  empirical  testing? 

2.  Is  there  an  optimum  configuration  for  selecting  the 
data  points  for  the  initial  solution  for  various 
models  ? 

3.  How  does  one  insure  that  the  observations  in  the 
initial  solution  are  blunder  free? 

4.  Can  this  algoithm  be  extended  to  permit  checking  all 
of  the  observations  sequentially,  even  those  in  the 
initial  solution,  thus  permitting  a  true  on-line 
check  for  applications  in  real  time  mensuration. 

5.  Can  this  algorithm  be  extended  to  the  case  where 
parameters  as  well  as  observations  are  being  added? 

Finally,  there  are  several  questions  raised  that  are  not 
directly  related  to  the  initial  solution.  First,  the  algor¬ 
ithm  was  tested  on  a  simple,  linear  model.  Tests  should  be 
conducted  to  determine  if  there  is  a  requirement  to  alter  the 
test  statistic  for  cases  when  nonlinear  adjustment  problems 
require  more  than  one  iteration  to  converge.  Also,  the  ever 
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present  question  of  weights  persists.  In  addition  to  simply 
trying  to  identify  weights,  is  there  an  optimal  way  to 
sequentially  add  observations  having  higher  weights?  The 
answer  would  seem  to  be  that  the  more  heavily  weighted  obser¬ 
vations  should  be  added  as  early  as  possible  since  weight 
denotes  certainty  in  the  measurement.  It  is  desirable  to 
strengthen  the  solution  early  to  catch  early  blunders  more 
efficiently.  However,  there  are  questions  of  geometry,  dis¬ 
persion  or  range  of  the  data,  and  so  on,  that  may  require 
data  to  be  added  in  a  different  manner.  Many  of  these  ques¬ 
tions  will  require  testing  actual  measurement  systems  to  test 
the  limitations  and  capabilities  of  the  sequential  adjustment 
blunder  detection  algorithm. 
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Appendix  A  :  AbbreviatiQn?  and  Nptatipqg. 
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X  -  the  vector  of  estimates  of  the  parameters. 

L  -  the  vector  of  observations, 
e  -  the  vector  of  observational  errors. 

Y  -  the  vector  of  expectations  of  the  observations. 

V  -  the  vector  of  residuals. 

A  -  the  coefficient  matrix  of  the  observations. 

B  -  the  coefficient  matrix  of  the  parameters /unknowns, 
n  -  the  total  number  of  observations  in  a  least  squares 
solution. 

u  -  the  total  number  of  parameters/unknowns, 
r  -  the  redundant  number  of  observations  (r=n-u). 
b  -  the  maximum  allowable  number  of  blunders  in  a  data  set. 
W  -  the  weight  matrix  of  the  observations. 

Q  -  the  cofactor  matrix  of  the  observations  (equals  the 
inverse  of  W). 

Q  -  the  cofactor  matrix  of  the  residuals, 
c  -  the  number  of  condition  equations  in  a  least  squares 
solution. 

Ho  -  the  symbol  for  the  null  hypothesis  in  statistics. 

Ha  -  the  symbol  for  the  alternate  hypothesis. 

the  level  of  significance  of  a  statistical  test. 

t  -  is  the  symbol  for  transpose  of  an  individual  matrix, 
t 

B  is  B( transpose)  for  example. 

J  -  the  symbol  refering  to  the  Jacobian  of  a  matrix. 
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1.  The  general  least  squares  model  may  be  written  in  matrix 
form  as 


A(L+V)+BX=D 

(Bl) 

AV+BX=F 

(B2) 

where 

A  has  dimensions  c  by  n, 

D,F  have  dimensions  c  by  1, 

L,V  have  dimensions  n  by  1, 

B  has  dimensions  c  by  u, 

X  has  dimensions  u  by  1. 

2.  However,  the  technique  of  adjustment  by  indirect  observa 
tions  produces  results  identical  to  those  above  and  is 
denoted  by 


A(L+V)+BX=D 

or 

(L+V)+BX=D 

or 

V+  BX=F 

where 

A  is  n  by  n  identity 
B  has  dimensions 

X  has  dimensions 

F,L,V  have  dimensions 


( B3 ) 

(B4) 

(  B5  ) 

matrix, 
n  by  u, 
u  by  1, 
n  by  1. 


3.  Equation  (B3)  may  be  partitioned  to  denote  the  contri¬ 
bution  to  the  total  solution  by  components  of  a  sequen¬ 
tial  solution.  Subscript  (1)  refers  to  the  portion  of 
the  model  associated  with  observations  already  included 
in  the  solution.  Subscript  (2)  refers  to  the  *  p  '  new 
observations  being  added  sequentially  to  the  solution. 
In  matrix  notation. 


1  X 


where  the  matrices  have  the  following  dimensions: 


V  is  n  by  1 
1 

B  is  n  by  u 
1 

F  is  n  by  1 
1 

X  is  u  by  1 
2 


V  is  p  by  1 
2 

V  is  p  by  u 
2 

F  is  p  by  1 
2 


(  B6  ) 


and. 


X  is  the  vector  of  parameter  estimates  obtained 
1 

using  only  the  observations  L  . 

1 

X  is  the  vector  of  parameter  estimates  obtained 
2 


using  both  L  and  L  . 

1  2 
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4.  The  solution  for  X  in  equation  (B6)  is 

2 

-1 

X  =( N  )  [T  +  T  j 
2  2  12 


(B7  ) 


where 


t  t 

N  =[B  W  B  ♦  B  W  B  ] 
2  111  2  2  2 


(B8 ) 


where 


t  t 

[T  +  T  ]  =  [B  WF+B  WF] 
12  111222 

W  =  fw  0  “ 

1 

0  W 


W  has  dimensions  n  by  n, 
1 

W  has  dimensions  p  by  p. 
2 


(B9) 


5.  The  residual  vector  obtained  by  considering  only  L  is 

1 


V  +  B  X  =F 
1  111 


(BIO) 


where 


t  -It 

X  =[B  W  B  ]  [B  W  F  ] 
1  1  11  111 


(Bll) 


N  =[B  W  B  ] 

1  1  11 

6.  Notations  for  the  residuals 

a.  V  ,  V  refer  to  vectors  of  residuals  from  the  solution 
1  2 

of  B6.  L  and  L  are  included  in  the  solution  of  X  . 

12  2 


b.  V  refers  to  the  residuals  obtained  from  the  solution 
1 


of  BIO  when  only  L  is  included  in  the  solution  of  X  . 

1  1 
A 

c.  V  refers  to  the  discrepancies  equal  to  F  -  B  X  . 

2  2  2  1 


V  is  one  estimate  of  V  and  is  termed  the  discrepancy 
2  2 

vector. 

A  A 

d.  V  ,V  ,  V  are  all  residuals.  V  is  a  discrepancy  vector. 
12  1  2 


7.  Formulas  for  residuals  and  discrepancies  follow: 

V  =  F  -  B  X 
1  1  12 


(B12) 


V  =  F  -  B  X 
2  2  2  2 


(B13) 


V  =  F  -  B  X 
1  1  11 


(B14) 


V  =  F  -  B  X 
2  2  2  1 


(B15) 


1.  From  equations  (B8)  and  (Bll), 


a 


< 


a  '• 


t  t  t 

N  =[B  WB+B  WB]=[N+B  W  B  ]  (Cl) 

2111222  1222 

2.  The  inverse  of  N  can  easily  be  found  using  the  well 

2 

known  matrix  relationship:  (Mikhail,  1976) 

If  X=[ Y  +UZV]  ,  then  (C2) 

-1  -1  -1  -1  -1  -1  -1 
X  =  Y  -  Y  U[Z  +  V  Y  U]  VY  (C3) 

3.  Step  2  gives  the  following  result  for  N  : 

2 

-1  -1  -It  -It  -1  -1 

N  =  N  -  N  B  [Q+BNB  ]  BN  (C4) 

2  1  12  2  212  21 

-1  -1 

4.  Also,  letting  C  =  N  and  C  =  N  , 

11  2  2 

X  =  C  T  =  C  T  +  C  T' 

2  2  2  2  1  2  2 

t  t  -1 

CT  =  CT-CB  [BCB  +  Q  ]  BCT 
21  11  12  212  2  211 

t  t  t  -1  t 

C  T'=  C  [B  W  F  ]  -  C  B  [BCB  +  Q  ]  BCB  WF 
22  12  22  12  212  2  212  22 

t  t  t  t 

=  C  B  WF-CB  WF+CB  [BCB  +  Q  ]  Q  W  F 
1222  1222  12  212  2  222 


t  t  -1 

=  C  B  [BCB  +  Q  ]  F 
1  2  2  1  2  2  2 
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SO  , 


X  =  C  T  +  C  T' 

2  2  1  2  2 

t  t  -1 

=  C  T  +  C  B  [BCB  +  Q  ]  [F  -  B  C  T  ] 
11  12  212  2  2  211 


5.  From  step  4, 

t  t  -1 

X  =  X  +  C  B  [BCB  +  Q]  [F-BX]  (C5) 

2  1  12  212  2  221 

t  t  -1 

6.  The  quantity  CB  [BCB  «•  Q  ]  is  known  as  the 

12  2  12  2 

Kalman  gain  matrix  and  will  be  designated  K. 

7.  Therefore, 

X  =  X  +  K[F-BX]  (C6) 

2  1  2  2  1 

and 

-1  -1  -1 

N  =  N  -  K  B  N  ( C7 ) 

2  1  2  1 

8.  These  equations  can  be  generalized  to  a  form  where  (i-1) 
corresponds  to  subscript  (1)  and  (i)  corresponds  to  the 
subscript  (2).  However  the  subscripts  1,2  simplify  the 
notation  and  will  be  used  throughout  this  paper.  Any 
conclusions  made  concerning  examples  using  this 
notation  can  easily  be  extended  to  the  general  form. 

9.  One  may  also  show  that  observations  may  be  easily  deleted 

simply  by  changing  the  signs  of  the  key  matrix  W  . 

2 


(Mikhail,  1976) 


.ix  2  :  The  Cofactor  Matrix  of  the  Residuals. 


The  cofactor  matrix  of  the  residuals  may  be  expressed  in 
matrix  notation  (using  the  method  of  indirect  observa¬ 


tions)  as 


-It 

Q  =  [  Q  -  B  [N  ]  B  ] 
2 


(Dl) 


where  Q  is  block  diagonal. 

Q  may  be  partitioned  and  calculated  as  follows: 


C  '• 


Q  Q  I  'Q  0  ] 

11  12  1 

Q  =  -  -  = 

Q  Q  0  Q 

21  22  2 


r  i  r  -f 

B  I  ,B  ! 

1'  -1  11 

[N  ]  1 

B  2  B 

2  12 

L  _  L.  *5/ 


where 


(D2) 


Q  is  the  cofactor  matrix  for  the  residuals  V  . 
11  1 


Q  is  the  cofactor  matrix  for  the  residuals  V  . 
22  2 


Q  =  Q  =  the  cross-cofactor  matrix  for  V  and  V  . 
12  21  12 


-1  -1 
Q  =  W  and  Q  =  W  . 

11  2  2 

Step  2  shows  that  the  cofactor  matrix  for  the  residuals 

of  the  observations  being  added  sequentially  can  be 

calculated  at  each  step  since  the  inverse  of  N  is 

2 

calculated  each  time. 


-It 

Q  =  [  Q  -  B  [N  ]  B  ] 
22  2  2  2  2 


(D3) 
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1.  Let  V  +  B  X  =  F  be  partitioned  as 


i-  - 
r 


3 


— 

V 

B 

F 

1 

+ 

1 

X  = 

1 

V 

B 

F 

2 

L2_ 

2 

(El) 


2.  Now,  complete  a  sequential  reduction  by  eliminating  the 

observations  L  that  correspond  to  V  ,B  ,  and  F  . 

2  2  2  2 


1 


l- 


l 

Si 


L*. 

L* 


-It  t  -1  _-l 

3.  Let  N  =[B  WB  +  B  WB]  and  let  N  be  the 

2  111  222 


inverse  of  the  new  normal  equation  coefficient  matrix 

after  the  observations  L  are  deleted. 

2 


4.  Using  the  results  of  Appendix  C,  and  calling  the  new 
vector  of  parameter  estimates  X' , 


t  -1  t  -1 

X  =  X  -  N  B  [Q-BNB]  [F-BX]  (E2) 

1  22  2222  222 

5.  From  ( B6 ) , 


X  =  X  . 


6.  Also, 

_-l  t  t  t  -1-1 

N  =  [  B  WB+B  WB-B  WB]=N. 

Ill  222  222  1 


A 

7.  V  =  F 
1  1 


B  X  since  only  observations  L  are  part  of  the 
11  1 


solution,  (see  Appendix  B) 
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8.  Combining  (E2),  (D3),  and  step  7  above, 


a  -It  _-l 

V  =  F  -  B  X  +  [BN  B  ]Q  [F-BX] 

1  1  12  122  22  222 

9.  From  (B12)  and  (B13), 

F  -  B  X  =  V  and  F  -  B  X  =  V 
1121  2222 

-It 

10.  From  (D2)  BN  B  =  -Q  . 

12  2  12 

11.  Therefore, 

A  -  -‘I 

V  =  V  -  Q  Q  v 

1  1  12  22  2 

where 


(E3) 


(E4) 


V  refers  to  the  residuals  for  observations 
1 

L  after  the  effect  of  observations  L  have 
1  2 

been  removed  from  the  solution. 

12.  Taking  this  a  step  further,  a  relationship  between  the 

residuals  V  and  discrepancies  V  can  be  developed. 

2  2 

a  -1  -1  t  -1 

V  =  F  -  B  X’=  F  -  B  [X  -  N  B  [Q  -  B  N  B  ]  [F-BX]] 

2  2  2  2  22  222  222  2  22 

-1  t  _-l 

=  F  -  B  X  +  BN  B  Q  [F-BX] 

2  22  222  22  2  22 

-1  t  _-l 

=  [  I  +  B  N  B  Q  ]  V 
222  22  2 
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13.  And, 


-1  t  _-l 
I  +  B  N  B  Q 

2  2  2  22 


_-l 

I  +  (Q  -  Q  )  Q 

2  22  22 


_-l 

=  I  +  Q  Q  -I 
2  22 


_-l 

Q  Q 

2  22 


14.  Therefore, 


A  — "  —  A 

V  --  Q  Q  v  or  V  =  Q  W  V 
2  2  22  2  2  22  2  2 


€2  '• 


(E5) 


1.  Show  that 


s 

Li 


<• 


i 


►* 


£ 


t_-l  A  “  1  t  -1  a 

V  Q  V  =  V  [Q  +  B  N  B  ]  V 
2  22  2  22212  2 

where 


V  and  V  are  defined  as  in  (B13)  and  B(15). 
2  2 

2.  From  Appendix  E, 


A  — 

V  =  Q  Q  V  . 
2  2  22  2 

3.  Or, 


V  =  Q  W  V  . 

2  22  2  2 

4.  Substituting, 

t  _-l  a  ^  " 1  —  A 

V  Q  V  =  V  WQQ  QWV  (FI) 

2  22  2  2  2  22  22  22  2  2 

since  W  and  Q  are  symmetric  matrices. 

5.  The  left  side  of  equation  (Fl)  =  V  W  Q  W  V  (F2) 

2  2  22  2  2 


6.  From  (F2)  and  (D3), 


_  t 

WQ  W=W  (Q-BNB  )W 
2  22  2  2  2  222  2 

-1  t 

=  [I  -  W  B  N  B  ]  W 
2  2  2  2  2 

-1  t 

=  [W  -  W  B  N  B  W  ] 

2  2  2  2  2  2 


ft  l 
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3 


7.  Using  the  same  equation  used  in  (C2),  one  can  show  that 


-1  t  -1  -It 

[Q+BNB]  =  W  -  W  B  N  B  W. 
2  212  2  2222  2 


8.  Therefore, 


t  _-l 


-1  t  -1 


< 


V  Q  V  =  V  [Q+BNB] 
2  22  2  2  2212 


(F3) 


tf  o 


P 


I 


f- 


« 


Appendix  G  :  The  Total  Sum  of  Weighted  Squared  Residua] 


1.  The  purpose  of  this  appendix  is  to  show  the  following: 

(C 


t  A  t  A  t  _"1 

V  WV  =  V  W  V  +  V  QV 
1  1  1  2  22  2 


2.  The  left  of  (Gl)  can  be  partitioned  so  that 


t 


3.  From  Appendix  E, 


V  =  V  ♦  Q  Q  V 
1  1  12  22  2 


V  =  Q  W  V 
2  22  2  2 


A  -  A 

V  =  V  +  Q  W  V 
1  1  12  2  2 

4.  So , 

V  W  V  =  [V  +  Q  W  V  ]  W  [V  +  Q  W  V  ]  (G 

1  1  1  12  22  11  12  22 

A  ^  A  A*-  —  A  A  ^  —  A 

=  V  W  V  «■  V  WQWV+V  WQWV 
1  11  1  11222  2  2  21  11 

A  t  —  —  A 

+  V  WQWQWV 
2  2  21  1  12  2  2 

5.  However, 

t  A  —  A 

B  WV  =  QWV  =  0. 

1  11  21  1  1 
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6.  Therefore, 

t  a^aa^  —  —  a 

V  W  V  =  V  W  V  +  V  WQWQWV  (G4) 

1  11  1  11  2  2  21  1  12  22 

7.  Similarly, 

t  a  t  _  _  a 

V  W  V  =  V  WQ  WQ  WV  (G5) 

2  22  2  2  22  2  22  22 

8.  Combining  (G2),  (G4),  and  (G5), 

t  A  ^  A  At  —  —  A 

v  W  V  =  V  WV+V  WQWQWV  (G6) 

1  11  2  2  21  1  12  22 


+  V  WQWQWV 
2  2  22  2  22  2  2 

9.  From  (Gl)  and  (G6),  it  is  only  necessary  to  show  that 

G  =  V  fc  W  Q  WQ  WV  +  V  WQWQWV 
2  2  21  1  12  2  2  2  2  22  2  22  2  2 

t  _-l 

=  V  Q  V  .  ( G7 ) 

2  22  2 

10.  Using  the  Distributive  Law  for  matrix  multiplication, 

G  =  V  fc  [W  Q  WQ  W+WQ  WQ  W  ]  V  . 

2  2  21  1  12  2  2  22  2  22  2  2 

11.  Then, 

_  -1  -1  t 

WQ  WQ  W  =  WBN  NN  B  W 
2  21  1  12  2  222122  2 


(G8) 
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12.  And, 

_  -It  -It 

WQ  WQ  W  =  W[Q-BN  B  ]  W  [Q  -  B  N  B  ]W 
2  22  2  22  2  22  222  22  222  2 

-1  t  -1 

=  [W  -  W  B  N  B  W  ]  [I-  B  N  B  W  ] 

2  2222  2  2222 

-It  -It 

=  [W  -  W  B  N  B  W]-[WBNB  W] 

2  2222  2  2222  2 


-It  -It 

+  [W  BN  B  WBNB  W] 
2222  2222  2 


(G9) 


13.  Combining  (G8)  and  (G9)  then  gives 

A  t  -1  -1  t  -It  -It 

G  =  V  [WBN  N  N  B  W+WBN  B  WBN  B  W 
2  222  122  2  222222222 

-It  -1  t 

-WBNB  W  ♦  [W  -WBN  B  W]]V 
2222  2  2  2222  2  2 

A  t  -1  t  -It 

=  V  [WBN  [N  +  B  W  B  ]  N  B  W 
2  222  1  2  22  22  2 

-It  -1  t 

-WBNB  W  +  [W  -  W  B  N  B  W]]V 
2222  2  2  2222  2  2 


A  t  -1  t  A 

=  V  [W-WBNB  W]V 
2  2  2222  2  2 


(G10) 


14.  Using  (C2)  and  (C3)  it  can  be  shown  that 

-It  -It  -1 

[W-WBNB  W]=[Q+BN  B  ] 

2  2222  2  2  212 

At  “  1  t  -  1  a 

G  =  V  [  Q  *  B  N  B  ]  V 
2  2  2  1  2  2 


o  <>  <> 


Using  the  results,  then,  of  appendix  F 


t  A 
W  V  +  G 
1  11 

a  t  *1  t  A 

W  V  +  V  [Q  +  B  N  B  ]  V 
111  2  2212  2 

t  a  t  _-l 

WV+V  Q  V 
111  2  22  2 

which  proves  that  (Gl)  is  indeed  true. 


(Gil) 


ix  H  :  Independent  Ou; 


Forms. 


The  purpose  of  this  appendix  is  to  show  that 

A  t  A  t  1 

V  W  V  and  V  Q  V  are  independent. 
Ill  2  22  2 

The  first  step  in  demonstrating  their  independence  is  to 


demonstrate  that  V  and  V  are  independent. 

1  2 

3.  Returning  to  the  basic  principles  of  error  propagation, 


Q  =  J  Q  J 
XY  XZ  ZZ  YZ 


(HI) 


«  '• 


4.  For  this  problem, 

a  -It 

V  =  F-  BX  =  F-  BN  [B  WF] 
1  111111111 


(H2) 


-It  t 

V  =  F-  BX  =  F-  BN  [B  WF+B  WF]  (H3) 

2  2  22  2  22  1  11  2  22 


The  only  common  variable  is  F  and  the  crosscofactor 

1 

matrix  becomes 

t 

Qa  =  JA  Q  J  •  (H4) 

VV  VF  FF  VF 

12  1111  21 

For  indirect  observations  (observation  equations), 

-1 

Q  =  W 

F  F  1 

1  1 
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-1  t 

7.  JA  =  I  -  B  N  B  W  (H5 ) 

V  F  1111 

1  1 

-1  t 

J  =  B  N  B  W  (H6 ) 

V  F  2  2  1  1 

2  1 

8.  Substituting, 

t 

•7  A  Q  3 

V  F  F  F  V  F 

11  11  2  1 

-It  -1  -1  t  t 

=  [ I-B  N  B  W  ]  [W  ]  [B  N  B  W  ] 

1111  1  2211 

-1  -It  -It 

=  [  W  -  BN  B  ][WBN  B  ] 

1  111  1122 

-It  -It  -It 

=  [BN  B  ]-[BN  B  WBNB] 

122  111  1122 

-It  -It 

=  [B  N  B  ]  -  [B  N  B  ]  =  [0]  (H7 ) 

12  2  12  2 


9.  Therefore,  V  and  V  are  independent  and  their  respective 

1  2 

quadratic  forms  are  also  independent  because  W  and  Q 

1  22 

are  matrices  of  constants. 

10.  If  the  assumption  is  made  that  the  observational  errors 
are  normally  distributed  with  a  mean  of  0  and  a  variance 

i 

of  '"To  ,  then 


V  — N  (  0,1  A  A  ) 
1  V  V 

1  1 


(  H8  ) 


« 
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V  -~N  (  0,  Z  )  (H9) 

2  V  V 

2  2 


] 


11.  This  further  implies  that: 

a  t  a  ^ 

V  W  V  — ' CHI-SQUARE(r)  x  i; 

1  11 

t  1  z 

V  Q  V  -^CHI-SQUARE(p)  xX, 

2  22  2 


(H10) 

(Hll) 


where  r  and  p  are  the  respective  degrees  of  freedom  for 
for  the  CHI-SQUARE  distributions. 


4 


I 


P 


Appendix  X  :  Expected  Values  of  Quadratic  Forms 

1.  From  (Gl)  and  Appendix  H, 


t  A  t  A  t  _-l 

E[ V  W  V]  =  E[V  W  V  ]  +  E[ V  Q  V  ] 

111  2  22  2 


A  ^  A 

2.  Because  V  W  V  is  distributed  as  a  CHI-SQUARE, 
111 

A  t  A  _* 

E[ V  W  V  ]  =  (r)  x  1Z  . 

1  11 


t  _-l 

3.  Because  V  Q  V  is  distributed  as  a  CHI-SQUARE 
2  22  2 


t  _-l  a 

E[ V  Q  V  ]  =  (p)  xTo  . 

2  22  2 


4. 


This  leads  directly, 
t 

E[  V  W  V  ] 


then,  to 
=  (p  +  r)  xT.  . 


(ID 


(12) 


(13) 
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Appendix  J  :  Residuals  and  Observational  Errors. 

1.  Given  the  case  of  adjustment  by  indirect  observations, 
the  basic  model  may  be  written  as 

Y  +  B  X  -  D  (Jl) 

where 

Y  is  a  vector  of  the  expectations  of  the 
observations. 

X  is  the  vector  of  parameters. 

D  is  a  vector  of  constants. 

2.  The  equation  for  consistancy  of  (Jl)  is  (Stefanovic,  1978) 

t 

[I-  B  Q  B  W]  [D-Y]  =  0  ( J2) 

xx 

where  Q  is  the  cofactor  matrix  for  X. 
xx 

3.  Y  can  be  expressed  as  a  function  of  observational  errors 
(e)  and  observations  (L)  by  Y  =  L  +  e. 

4.  Then,  from  step  3, 

t 

[I  -  B  Q  B  W]  [D  -  (L+e)]  =  0  (J3) 

xx 

5.  Steps  3  and  4  lead  to 

t  t 

[I  -  B  Q  B  W]  [D  -  L]  =  -[I  -  B  Q  B  W]  e.  (J4) 

XX  XX 

6.  Since  F  =  D  -  L, 

t  t 

[I-BQB  W]F=-[I-BQB  W]e  (J5) 

XX  XX 

7.  The  least  squares  estimate  of  the  residuals  is 

t 

V  =  [  I  -  B  Q  B  W]F  ( J6 ) 

xx 


71 


72 


8.  From 


9.  J8  i 


i 


I  ^ 


I 


4 


4 


4 


steps  6  and  7 , 

t 

V  =  -  [I  -B  Q  B  W]e 
xx 

t 

=  -  [Q  -B  Q  B  ]  e 
xx 

=  -  Q  W  e 

s  a  key  relationship  for  several  reasons. 

a.  E(e  )=0  implies  E(V  )=0  as  well. 

i  i 

2.  -  i 

b.  Var(e  )  =  ^  implies  that  Var(V)=QY0 

i 

where  V  is  the  vector  of  residuals. 


(J7) 


(  J8  ) 


Step  1:  Read  the  number  of  parameters,  the  initial 
redundancy,  the  number  of  observations  to  be  added  sequen¬ 
tially,  the  level  of  signif icance( alpha) ,  and  the  sequential 
file  tape  to  be  used  depending  on  whether  the  data  is  to  be 
added  in  forward  or  reverse  order. 

Step  2:  Read  each  observation,  its  observation  equa¬ 
tion  coefficients,  the  observation  weight,  and  the  observa¬ 
tion  equation  constant. 

Step  3:  Write  two  sequential  files.  The  first  file 
will  contain  the  observations  to  be  added  sequentially  in  the 
order  listed  in  the  data  set.  The  second  sequential  file  will 
contain  the  same  data  but  in  reverse  order.  (See  chapter  4). 

Step  4:  Now  complete  an  initial  solution  using  the 
first  (u+rl)  observations.  These  observations  are  not 
included  in  either  of  the  sequential  files  formed  in  step  3. 

Step  5:  Add  'p'  observations  to  the  sequential  adjust¬ 
ment.  Compute  the  new  parameter  estimates,  the  new  parameter 
estimate  coefficient  matrix,  the  quantity  in  equation  F3. 

Step  6:  Compute  the  test  statistic  proposed  in  section 


4 


74 

Step  7:  Compare  the  test  statistic  to  the  F-statistic 
with  the  hypothesis  test  in  section  2.4. 

Step  8:  If  the  observation  is  accepted,  go  to  step  5 
and  add  another  'p'  observations  until  all  of  the  observa¬ 
tions  have  been  considered. 

Step  9:  Update  the  redundancy,  the  sum  of  the 
weighted,  squared  residuals  (using  the  equation  in  Gil)  to 
reflect  the  observations  added  and  accepted  in  the  sequential 
adjustment. 

Step  10:  If  the  observations  are  rejected,  perform 
sequential  adjustment  to  delete  the  rejected  observations, 
then  go  to  step  5  and  add  another  ’p’  observations  until  all 
observations  have  been  considered. 


« 


This  appendix  contains  the  computer  program  developed 
to  impliment  the  sequential  least  squares  adjustment  blunder 
detection  algorithm.  The  programming  language  is  Fortran  77. 
The  computer  system  used  was  an  IBM  -  370.  TL«?  subroutines 
contain  program  logic  at  the  beginning  of  the  routine. 
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FSTATl  =  IOELTAU)/H)/{  Sl/KII 
FSTAT2=1 

CALL  FTEST2(FSTAT2,P.K1 .ALPHA, TABLE  I 
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